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Abstract — Agarwal et al. gave an closed-form expression for 
write amplification in NAND flasli memory by finding tlie 
probability of a page being valid over the whole flash memory. 
This paper gives an improved analytic expression for write 
amplification in NAND flash memory by finding the probability 
of a page being invalid over the block selected for garbage 
collection. The improved expression uses Lambert W function. 
Through asymptotic analysis, write amplification is shown to 
depend on overprovisioning factor only, consistent with the 
previous work. Comparison with numerical simulations shows 
that the improved expression achieves a more accurate prediction 
of write amplification. For example, when the overprovisioning 
factor is 0.3, the expression proposed by this paper gives a write 
amplification of 2.36 whereas that of the previous work gives 
2.17, when the actual value is 2.35. 

I. Introduction 

Flash memory is a storage medium with growing signifi- 
cance. It has many appealing features including non-volatility, 
small size, low-cost, mechanical reliability, low power con- 
sumption and low read latencies particularly when compared 
to hard disk drives IT] |2]. 

Flash memory is organised in blocks. A block has a fixed 
number of pages (typically 64 pages). A page has a fixed size 
(typically 4KiB) |3|. There are three kinds of operations on 
flash memory: read, write and erase. Read and write operation 
can be performed on a page basis |4|. 

Flash memory has limitations that challenge the design of 
flash memory systems. One fundamental limitation of flash 
memory is nonsupport of overwriting and block erase. After 
data are written into a page, new data can not be written into 
the page by overwriting. The page must be erased before 
new data can be written to it. But erase operation can be 
performed only on a block basis |5|. Another fundamental 
limitation of flash memory is limited endurance. Flash memory 
can tolerate a limited number of program and erase cycles 
before it becomes unreliable. The latest multi-level cell (MLC) 
memories can endure 5000 ^ 10000 program and erase cycles 
m [2|. Thus, it is a critical problem to limit the number of 
program and erase cycles to a minimum to increase the flash 
memory's lifetime. 
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Erasing an entire block when a page is needed to be updated 
is inefficient. Moreover, this can wear out the flash memory 
very quickly. Thus, in flash memory, out-of -place write is used 
|2|. When a page already written needs to be updated, the new 
data are not written into that page but to a new page free for 
writing and the old page is marked as invalid. A mapping table 
is maintained to record the mappings of logical address and 
physical address. In this paper, pages are classified into three 
categories and their definition is given as follows. A free page 
is a page into which no data have been written. It is available 
to accommodate new user writes. A valid page is a page into 
which data have been written but hasn't been updated. A valid 
page is the page that stores the user data. An invalid page is a 
page into which data have been written and has been updated. 
An invalid page once stores user data, but it is updated and 
no longer stores user data due to out-of-place write. 

Out-of-place results in invalid pages. Invalid pages consume 
the flash memory but do not store user data. Thus, when 
the number of invalid pages accumulate to some extent, the 
invalid pages should be reclaimed for new user writes. The 
mechanism in flash memory to reclaim invalid pages and 
translate them into free pages is called garbage collection |2l. 
Garbage collection is performed in the following way: first a 
block is selected for garbage collection. Then, the valid pages 
in the selected block are copied to some other free space in 
the flash memory. After this, the selected block is erased and 
becomes free. Various algorithms for garbage collection have 
been proposed in previous work |6| |7| |8|. 

During garbage collection, valid pages in the selected block 
are copied to some other free space before the selected block 
is erased and copied back to the flash memory afterwards. This 
copy operation causes additional writes. The actual number of 
writes on the flash memory would be more than the number of 
pages needed to be written. This phenomenon is called write 
amplification. Write amplification reduces the flash memory's 
lifetime and therefore should be minimized. In flash memory, a 
common practice is that, user can only use a portion of the raw 
flash memory space. The portion the user cannot use is called 
overprovisioning. Overprovisioning provides flash memory 
with an increased endurance and an improved performance. 
Increasing the amount of overprovisioning decreases write 
amplification. 



Due to the significance of minimizing write amplification, 
previous works analyzed write amplification. Hu et al. first 
developed a probabilistic model to describe the write am- 
plification 13]. They then developed an empirical model to 
compute the write amplification f4|. 

Agarwal et al. gave a closed-form expression for write 
amplification Q. They assumed that initially, the user space 
of the flash memory is full and the valid pages are randomly 
distributed over the physical flash memory. The number of 
valid pages in an arbitrary block is then binomially distributed. 
After a sufficiently large number of user writes, the distribu- 
tion of the number of valid pages in an arbitrary block is 
empirically approximated by a uniform distribution. Because 
both the total number of pages and the total number of valid 
pages remains the same, the average number of valid pages in 
an arbitrary block remains the same. Thus, the expected value 
of these two distributions is equal. By solving this equation, 
an expression for write amplification is obtained which is a 
function of overprovisioning factor. 

This paper follows this work, and proposes an improved 
analytical expression for write amplification in NAND flash 
memory. First, X, the number of invalid pages freed by each 
garbage collection is studied. After a sufficiently large number 
of user writes, X evolves into a stationary state. We assume 
that, in the stationary state, the expected value of X is a 
constant x. Another assumption is made and justified that, X 
has a binomial distribution with probability p\s. However, p\s 
and the expected value of the binomial distribution depend on 
X. Since the expected value of the binomial distribution is x, 
we form an equality which can be solved for x. An asymptotic 
value of X and an asymptotic value of write amplification A are 
obtained using Lambert W function when the total number of 
pages accessible to the user is very large. As with the previous 
work, j4 is a function of overprovisioning factor p. 

The key of both papers is finding the probability of a page 
being invalid or valid. Agarwal et al. find the probability of a 
page being valid over the whole flash memory while this paper 
finds the probability of a page being invalid over the block 
selected for garbage collection. Comparison shows that the 
improved expression achieves a more accurate prediction of 
write amplification than the previous work. For example, when 
the overprovisioning factor is 0.3, the improved expression 
gives a write amplification of 2.36 and that of the previous 
work gives 2.17 whereas the actual value is 2.35. 

The rest of the paper is organized as follows: Section 
specifies the system model assumed. The analysis of write 
amplification and derivation of closed-form expression are 
given in Section Hill In Section HVl the assumptions in Section 
Hm is justified and the comparison between the result of this 
paper and previous work is given. 

II. System Model 

The flash memory is organized in blocks. There are a total 
of T physical blocks in the flash memory. Each block has a 
fixed number of pages, denoted by Np. The portion of the flash 
memory space that user can use is U blocks. Overprovisioning 



factor p is defined by: 
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The flash memory maintains a free block pool and an 
occupied block queue. Blocks that have at least one free page 
are in the free block pool and blocks with no free pages are 
in the occupied block queue. Initially, the flash memory is 
empty and all the blocks are empty and are in the free block 
pool. This paper assumes random writes which are uniformly 
distributed on the user address space. Each write request has 
a fixed size of one page. When a write request comes, data 
are written to a free page in a block in the free block pool. 
If the write request falls on a user address that has not been 
written before, this write request is called a "write". If the 
write request falls on a user address that has already been 
written, this write request is called an "update". When it is 
a write, a free page is found and the data are written into 
that page. The mapping between logical address and physical 
page address is recorded in a mapping table. When it is an 
update, the physical page storing the old data is found through 
the mapping table and marked as invalid. A free page would 
be found to write the new data and the mapping table is 
updated accordingly. Gradually, the blocks in the free block 
queue become full and are moved into the occupied block 
queue. When the number of free pages reaches a predefined 
threshold, garbage collection is triggered to collect the invalid 
pages. Greedy garbage collection is assumed where garbage 
collection is triggered when no free page exists. The block 
with the maximum number of invalid pages is selected for 
garbage collection. The valid pages in the selected block is 
copied to some auxiliary space in the flash memory. Then the 
selected block is erased. Finally the valid pages is copied back 
to this block. This block now joins the free block pool and 
is the only block in the free block pool accommodating write 
requests. 

Suppose there are A^jp invalid pages freed by garbage 
collection. Notice that, copying the valid pages contributes 
-^p ~ ^ip writes on the flash memory. And as a result, A^jp 
free pages are obtained and therefore A"ip new user writes can 
be accommodated. Write amplification. A, is defined as the 
average number of physical page writes per user page write 
[2|. It is given by: 



iVp - iVip + iVjp ^ iVp 



(2) 



Due to the greedy garbage collection policy, at most one 
block can have free pages and be in the free block pool. Thus, 
the flash memory model can be further simplified into a single 
block queue. Blocks are sorted by their time in the queue since 
they were last selected for garbage collection. They are labeled 
0, 1,2, ...,T — 1 from the head of the queue. When no free 
pages exist in any block, the block with the maximum number 
of invalid pages is chosen and removed from the queue. All 
blocks after this block moves one step towards the head of the 
queue. After the erase operation, this selected block is added 



to the tail of the queue with N\p free pages and iVp — iVip vaHd 
pages. In this model, only the last block (block T — 1) can 
have free pages and garbage collection is triggered when this 
block is full. 

A demonstration of this model is shown in Fig. [T] "V" 
stands for valid, "I" for invalid and "F" for free. The demon- 
stration includes four blocks labeled block 0,1,2,3 and each 
block has four pages. The overprovisioning factor is i. This 
means the user can only use 12 pages out of the 16 pages. In 
step (a), there are 12 valid pages in total indicating that the user 
address space is already full and only update can be performed. 
At this time the flash memory is full and garbage collection 
is triggered. Block 1 should be selected for garbage collection 
because it has the maximum number of invalid pages. Block 
1 is removed from the queue and block 2,3 moves one step 
towards the head of the queue becoming block 1,2. The valid 
pages in the selected block shown as blue V's in step (a) are 
copied to some auxiliary space and then the selected block is 
erased and added to the tail of the queue becoming block 3 as 
is shown in step (b). The valid pages are copied into Block 3 
shown as blue V's in step (b). The original two invalid pages 
now becomes free pages accommodating new user writes. Two 
user writes updates the red pages shown in (b). These pages 
are first marked as invalid shown as red I's in (c) and new data 
are written into block 3 shown as red V's in (c). Now garbage 
collection is triggered again. Block should be selected for 
garbage collection which is performed in the same way as in 
step (a). 
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III. Write Amplification Analysis 

For the convenience of explanation later, first the definition 
of a block cycle is given. A block cycle is the process from 
when garbage collection is performed on a block to the time 
when this block is selected again for garbage collection. From 
the point of view of the block queue, a block cycle is the 
process from a block entering the block queue being block 
T — 1 to the time when it is removed from the block queue 
for garbage collection. 

Let X denote the number of invalid pages freed from 
garbage collection. The following assumption was used in ||2| 
and is further supported by simulation in Section |IVl 

Assumption 1: After a sufficiently large number of user 
writes, X evolves into a stationary state and has a constant 
expected value E(X) = x. 

For any block selected for garbage collection, the following 
assumption is given and will be justified later. 

Assumption 2: When a block is selected for garbage col- 
lection, all the pages in this block have the same probability 
Pie to be invalid. 

Due to this assumption, X is a binomial random variable 
with A^'p trials and each trial has a probability to be 
successful. The expected value of X is NpPi^. Therefore, using 
Assumption [T] 

NpP[e = X. (3) 

The following derives pi^. Consider a certain valid page a. It 
corresponds to one page of data in the user space. Because the 
user writes are randomly distributed on the user space and are 
mutually independent and the total user space is U Np pages, a 
user write has a probability of jjjj- to update the data page a 
stores, and make it invalid. After k user writes, page a remains 
valid if none of these k user writes update the data that page 
a stores. Let pvaiid denote the probability that a valid page a 
remains valid after k user writes. 



Pvalid = (1 
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(4) 



Then, pinvaiid, the probabiUty that page a becomes invaUd after 
k user writes: 



Pinvalid — 1 — Pvalid — 1 ^ (1 ^ 
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UN, 



(5) 



Fig. L Demonstration of simplified model 



Let fee denote the total number of user writes during an 
entire block cycle. According to Assumption|2] p\e is the value 
of Pinvalid when k = fcg. Thus, to determine pinvaiid, k = kg 
should be determined. According to Assumption [1] each 
garbage collection collects x invalid pages. This means, after 
each garbage collection, there are x free pages in the block 
T—1 translated from invalid pages freed by garbage collection. 
Then, x user writes fills this block and triggers the next 
garbage collection. Thus, between two garbage collections, 
one block (block T — 1) becomes full and there are x user 
writes. 

Assumption 3: A number of T blocks become full during 
an entire block cycle. 



This assumption will be justified by simulation in Section 
IIVI Using this assumption, 

kg = Tx. (6) 

Apply equation ^ to equation ([5]), Pie, the value of Pinvaiid 
when k — fcg, is: 

1 



Pie = 1 - (1 - 
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Apply equation ^ to equation (O, 

ivp(i-(i-^r)=., 



(7) 



(8) 



where Np,T,U are constants for a specific flash memory. 
Let W(a;) denote the Lambert W function ||9l- Solving this 
equation for x. 
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Thus, write amplification A is. 
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As was mentioned, practical flash memory has a very large 
number of blocks so that UNp 3> 1 . Consider x, the asymp- 
totic value of X as UNp becomes large: 
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The asymptotic write amplification A is: 



(11) 



A = ^ 

X 



-1 



-1- p-\N{{-l- p)e-^-p)' ^^^^ 

For comparison, the closed-form expression for write amplifi- 
cation Apre proposed in the previous work 12J is: 
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(9) Fig. 2. Evolution of tlie number of invalid pages freed by garbage collecdon 



selected for garbage collection. Thus, all the valid pages in 
the beginning of the block cycle have the same probability pie 
to be invalid at the end of the block cycle. The free pages in 
the beginning of the block cycle are written consecutively and 
each of them experience one less user write than the previous 
page. Practical flash memory has a very large number of blocks 
so that Tx ^ 1. Thus, it is reasonable to assume that each 
page in a block has the same probabiUty pie to be invalid when 
selected for garbage collection. 



IV. Simulation 

A flash memory simulator was written and simulations 
were run to justify the assumptions and verify the validity 
of equation (fT2] l. All simulations below start with an empty 
flash memory. 

First, Assumption [T] is justified. Simulations were run with 



Comparison of equation ( fT2] l and equation ( fT3] l shows that, 
the asymptotic write amplification expression obtained by this 
analysis is a function of overprovisioning factor only, which 
is consistent with the previous work. It is independent of the 
number of pages per block. 

Now, Assumption |2] is justified. Consider a certain block, 
at the very beginning of its block cycle with x free pages and 

Np - X vaUd pages. After fee user writes, this block is again Fig- 3. Evolution of the number of invalid pages freed by garbage collecdon 
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Comparison of Results 
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Fig. 4. Approximation of number of blocks becoming full in a block cycle 
using T 



different combinations of U, Np and p. Fig. |2] shows that when 
p = 0.20 and Np — 256, X, the number of invahd pages freed 
by garbage collection, evolves into a stationary state after a 
sufficiently large number of garbage collections. It also shows 
that X converges to the same value, which is close to the 
analytical result 80.31 given by equation ( fTTT i. regardless of 
the value of U. This complies with equation (fTTT i that x is 
independent of U. Fig.[3]shows that X evolves into a stationary 
state regardless of the value of A^p and p as well when the 
number of user writes is sufficiently large. Therefore, after a 
sufficiently large number of user writes, the average value of 
X always converges to a constant x. It also closely approaches 
the analytical value. 

Fig. |4] shows that for different values of U, Np and p, 
T is always a good approximation of the number of blocks 
becoming full in a block cycle. This justifies Assumption |3] 
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Fig. 5. Comparison of results 



Finally, equation ( fT2] ) is evaluated and compared to the 
previous work equation ( fTSl ). From equation (fT2l i. write ampli- 
fication is independent of U and A'^p. Thus, comparison is made 
only for different p values. Simulation is run using Np — 256, 
U — 1024, and p varies from 0.15 to 1.00. The comparison 
of results from simulation, this analysis (equation (fT2] i) and 
previous work (equation (fT3] l) is shown in Fig. |5] and Table 
J] This shows that, the result of equation (fT2] i is much closer 
to the simulation value than the result of previous work. For 
example, when p ~ 0.30 which is a commonly adopted value 
in practical flash memories, equation (fT2] i predicts A = 2.36 
whereas equation (fTsT l gives Aprs = 2.17, when the actual 
value is 2.35. 
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